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ABSTRACT 


The nature of multiple solutions 'to a. general Problem of 
Bolza in the calculus of variations is investigated. These 
multiple stationary solutions are of several distinct types , 
each of which is briefly discussed. 

Multiple stationary solutions often arise when only first 
order necessary conditions have been applied. A second order 
test to eliminate some multiple solutions of this type is developed.' 

It is shown that second order tests can be broken down into path 
tests and endpoint tests and that once the first order necessary 
conditions of the calculus of variations have been applied, the 
second order path test and the endpoint ‘test can be applied 
independently . 

Only the second order endpoint test is investigated. Direct 
application of this condition requires the ' analytical integration 
of a set of nonlinear differential equations subject to mixed 
boundary conditions. Since this is often difficult or impossible, 
an algorithm is developed for numerically implementing the second • 
order endpoint condition. A geodetics problem is solved analytically 
to illustrate the theory and demonstrates that the second order condition 
is an effective computational tool, by eliminating certain classes of 
non-optimal solutions from consideration. 


vi 



HfTSODUCTIOET • 


1.1 The Problem, of Bolza 

Since much of what is to follow depends upon an understanding 
of the -Problem of Bolza as formulated in- control notation (Vincent 
and Brusch, 1966 , pp. 4-5 ) 5 a brief statement of the problem in its 
simplest form is appropriate. 

Among the set of all continuous state functions} 

y.(t) i = lj 2 , n; t g t g t 

X O I 

and continuous control variable functions 


^(t) k = 1} 2, 


m < n 


satisfying differential equations and end-conditions of the form 


y i = f i^ y j 5 U k’ 5 = 1 , 2 , ...» n 


^ y io 5 ‘ y if 5 ~^o 5 ^f ^ ~~ ® -1) 2 5 ...j-p g2r 


+ o 


find the set which will minimize a sum of the form: 

( t, 

't 


J/ = s( y io’ y ±f > t Q , t f ) + J Lfy^t), u^t), t) dt . 


Here it is assumed that the functions f., Lj g, and t are of class C c 

t) SL 

In the above and throughout this presentation, a dot above a variable 
will be used to represent the derivative of the variable with respect 
to t, the independent variable. Likewise the subscripts o and f will 
indicate the evaluation of the variable or expression at the initial 


( 1 . 1 . 1 ) 

( 1 . 1 . 2 ) 

(1.1.3) 

(1.1.4) 

(1.1.5) 



and final value of t, respectively. For the sake of brevity, the 
range of subscripts i, j, k, and SL will be as given above and will 
not be repeated in what follows. 

Following the conventional method of Lagrange multxpJLiers 
(Bryson -and Ho, 1969)5 minimization of the augmented function 

't 


■jj 

J* = g + 1 + J f d-xyi + x^O dt 


is considered.. In the above equation and throughout this report, 
repeated subscripts will be used to signify summation. Equation (l. 
was obtained by adjoining equations (1.1.3) and (1.1.4) to relation 
(1.1.5) as follows: 

(a) multiplying relations' ( 1.1. 3) -by the variables X^(t), 
respectively, integrating from t to t f and by adding the 
sum of the integrals to egression ( 1.1.5)', 

(b) multiplying equations (1.1.4) by the parameters u and 
adding the sum of the products to expression (1.1.5). 

It is convenient to define the following functions: 


fl ( y io I y if *0* V " S + 


xgt), u k (t), t] = 


The function. H is often referred to as the Hamiltonian. With these 
definitions, equation (1.1.6) may --be written as 


/ = G + J f [-H + X^.] dt 

u 

o 


By considering small variations in the path and endpoints about a 
nominal path, it can be shown that if the functions u^.(t) an d y.(t) 


( 1 . 1 . 6 ) 
• 6 ) 


(1.1.7) 

( 1 . 1 . 8 ) 

(1.I.9) 



are a solution to the Problem of Bolza, then they must satisfy thi 
following necessary conditions (Hestenes, 1966, pp. 346-351): 


Condition I.- There exist continuous multipliers X^(t) and 
Hamiltonian function as defined by equation (1.1.8) such that: 
(l) the Euler-Lagrange equations. 


k - - 


6H 

*i 


( 1 . 1 . 10 ) 


&H z (] 

bu k 


are satisfied at every point along the path and, 
(2) the transversality conditions 

o ' 

5G 

T7“ + H =0 
ot o 
o 


( 1 . 1 . 11 ) 


(-1.1.12) 


fcG 


by. 

J 10 


- X. = 0 
10 


(1.1.13) 


ft - « 


*if 


are satisfied by the endpoints. 


Condition II. The inequality 


( 1.1. 14) 


(1.1.15) 


H[y i (t), X i (t), u£°(t), t] gH[y i (t), X.(t), u^t), t] 


(1.1.16) 



must be satisfied for all t ^ t ^ t „ and for all non-optimal 

o 1 

control functions u 110 (Weierstrass Condition) . 

Condition III. - The k by k matrix 

s = 1 , 2 ^ j k 


t = lj 2j . . . , k 


5 2 H 


bu bu, 
s t 


(1.1.17) 


must be negative semi-definite for a minimum (Legendre -Clebsch 
condition) . • 


Condition TV. A fourth necessary condition is discussed by 

Bliss (1946, pp. 226-228) in classical dependent variable notation. 

- • ^ 

He proves that the, second order variation of a sum similar to J 
must be non-negative along a stationary arc, if that arc minimizes J 
Hestenes (1966, pp. 283-286) verifies this conclusion in modern 
control notation for the Problem of Bolza with fixed endpoints. As 
developed by 'Hestenes, the fourth necessary condition- represents 
a necessary condition on the path alone; variations in the endpoints- 
are not considered. If Condition III is satisfied. Condition XV is 
usually referred to as the Jacobi Condition. A further geometric 
interpretation of this condition is presented in the following 
section . 


1.2 The Nature of Multiple Stationary Solutions 

It is helpful in understanding the nature of multiple stationary 
solutions to classify them by the circumstances pertinent to their 
occurrence . 


k 



f ixed End-point Problems - Multiple stationary solutions are 
often obtained for problems with fixed endpoints, because only the 
first three necessary conditions have been applied. Bliss (19^6, 
p. 235) has shown in' dependent variable notation that the fourth 
necessary condition of Jacobi, taken together with the first three 
necessary conditions, suitably strengthened} forms a sufficient set 
of conditions for the Problem of Bolza. 

Although analytically complex, the Jacobi condition has a 
simple geometric interpretation for" problems with one state variable. 
In this case the set of all solutions forms a one dimensional family 
of extremal arcs, y = y(t, c) , which all pass through the initial 
point as shown in Figure (l.l) . With each "arc is associated a 
particular value of c. If arcs y(t, c) and y(t, c + e) intersect in 
the limit as e goes to 0, the point of intersection is called a 
conjugate point. The locus of such intersections is called the 
discriminant locus, also shown in Figure (l.l) 



Fig. 1.1 A Family of Extremals and the Discriminant Locus 


5 


In terms .of this geometry, the Jacobi condition requires that 
an optimal trajectory contain no conjugate point. Alternatively, 
the condition requires that an optimal solution may not touch the 
discriminant locus.' Figure (l.l) shows that there are two solutions 
joining points 0 and A, one of which touches the discriminant locus 
and is therefore non-optimal. If the fourth necessary condition 
of Jacobi is applied in such cases of multiple stationary solutions, 
usually all but one of the trajectories will be shown to contain a 
point conjugate to the initial point, thus rendering them non-optimal. 

. ; Examples of this occurrence are many; The Brachistochrone 
problem with fixed endpoints graphically illustrates the idea. 
Consider the problem of a bead sliding down a wire under the 
influence of gravity alone. What should the shape of the wire be 

o 

in order to minimize the time of transit between two points in a 
vertical plane? It is well known that the solution curves are 
cycloids. However, as shown in Figure (1.2) , there are several 
different cycloids which satisfy the necessary conditions of the 



Fig. 1.2 Multiple Stationary Solutions for the Brachistochrone Problem 



calculus of variations. It can be seen that the x-axis forms the 
discriminant locus and that the points where solutions 1 and 2 touch 
the discriminant locus are conjugate -points. Since solutions 1 and 2 
violate the Jacobi condition, it is evident that solution 2 is the 
true optimum. 

In this case xu uas ueen pussiDxe ~eo axsuinguisn une true 
optimum from the candidates by applying a sufficiency condition 
pertaining to the field of extremal paths. 

Variable Endpoint Problems - Problems with variable endpoints 
require that the endpoints of the trajectories, as well as the path, 
be selected in an optimum fashion. Consider the geodesics problem- 
of trying to find the minimum distance from the origin to a given 
parabola as shown in Figure (1.3). It will be shown in Chapter 2 
, that two stationary solutions exist, viz., OA and OB. Once an 
endpoint has been selected, the problem becomes one of fixed endpoints. 



Fig. 1.3 Multiple Stationary Solutions to a Geodetics Problem 



and the path sufficiency conditions previously discussed can then 
he applied. In this case, with the endpoint, A or B, thought of 
as being fixed, it can he shown that both solutions satisfy the 
Jacobi sufficiency condition (Bolza, 1961 , pp.' 84 - 86 ).' The 
endpoints shown satisfy first order necessary conditions only, it 
is apparent that a second order endpoint condition may be useful 
in distinguishing the true optimal. A second order condition will 
be derived in Chapter 2. While the second order condition obtained 
is akin to the classical focal point condition (Bolza, 1961, pp. 104-10), 
the result is new in form and is directly applicable to the optimal- 
control problem. 

- Problems 'Requiring First and Second Order Conditions - In the 
last two sections, the necessity of using fixed endpoint path- 
sufficiency conditions and second order endpoint conditions was 
illustrated separately. It is not unusual, however, to encounter 
problems requiring the application of both the Jacobi condition and 
second order endpoint conditions to distinguish the true optimum 
from the set- of multiple stationary solutions. To illustrate this 
situation, reconsider the Brachistrochrone problem where the 
final endpoint, instead of being fixed, is required to be on curve- E 
as shown in Figure (1.4). Both trajectories OAB and OD satisfy 
second order endpoint conditions; that is, both solutions would 
represent a local minimum with respect to small variations of the 
endpoint along endpoint manifold E. As discussed before, point A 
is a conjugate point, thus violating the Jacobi condition. Trajectory 0C 
violates a second order endpoint condition; it is in fact a loeal 
maximum with respect to small variations of the endpoint along E. 



0 


A 



Fig.. 1.4. A Problem Requiring Second, Order' Conditions 

Thus through the use of both the Jacobi condition and a second order 
endpoint condition; OB is selected as -the optimal candidate. , 

Problems with Periodic Solutions - Consider a system of equations 
(1.1.3) which exhibit periodic oscillations when no control effort 
is applied. • It is not unusual for the optimal controls and adjoint 
"variables of such a system to also demonstrate periodic motion with 
'the same period; especially if the magnitude of the control is small. 

The criterion by which the solution is terminated is also often 
periodic for problems exhibiting periodic oscillations . The terminating 
or cutoff condition is obtained from the transversality conditions . 
(1.1.12)- ,( 1 . 1 . 15 ) ty eliminating the u parameters, to form a 
single relationship among the state and adjoint variables. The- 
zeros of the cutoff function then represent the terminating condition. 

As an example , consider the problem of getting the mass of a thrusting 
-harmonic oscillator to a specified height while minimizing the integral 
of the thrust with respect to time. A mass is connected in a parallel 
by a spring and dashpot to an inertial reference. The mass is capable 


CUTOFF FUNCTION 


of generating a bounded thrust in the upward direction. For simplicity 
it is assumed that the mass is constant. For small damping factors 
and null thrust, the state variables, position and velocity, the 
adjoint variables, and the cutoff function all exhibit damped, 
periodic oscillations. As shown in Figure (1.5) the cutoff condition 
is satisfied during each period. For sufficiently small thrust 
amplitudes the_ cutoff function will deviate only slightly from that 
generated for null thrust, and will be satisfied at several points. 



TIME 


Fig. 1.5 Multiple Solutions Due to a Periodic Cutoff Function 

. Each time the cutoff condition is satisfied, a potential 
optimal endpoint and a corresponding stationary solution is obtained. 
Thus in periodic systems with weak bounded control, multiple 
stationary solutions may be encountered. 



Problems with Singular Control - In cases where the Hamiltonian 


is linear in a bounded control variable , the control cannot be 
determined from the Euler -Lagrange equation (l.l.ll). In this case 
the Hamiltonian can be .written as 

H = S(y i , t)u + Q(y ± , \ ± , t) (1.2. l) 

for scalar control. S is referred to as the switching function. 

The well-known Maximum Principle for problems with bounded control 
developed by Pontryagin et al. (19.62) requires that 

u = u' when S > 0 

max 

( 1 . 2 . 2 ) 

u = u . when S < 0 
■ jnm 

However, for the case when S = 0 over a non-vanishing tame interval, 
the Maximum Principle is indeterminant , and u may take on intermediate 
values. This is the ease of singular control. Leitmann (1966, pp. 57-58) 
has pointed out that,. "While it is possible in a particular problem.... 
to rule out the possibility of [singular control], this cannot be done 
in general." 

To demonstrate the existence of multiple stationary solutions in 
the case of singular control examine the problem of minimizing 

1 p “ p 

1=2 J x i dt (1.2.3) 

subject to the constraints: 


\ = x 2 + u 

0 

11 

H* 

O 

£ 

11 

0 

( 1 . 2 . 4 ) 

*2 = “U 

*>(°) * x 20 

■>£ 

T 

11 

0 


M 





11 



This problem was first discussed by Johnson and Gibson ( 1963 ) » 

The Hamiltonian is 

H = - l 2 )u + \ ± x 2 - x^/2 

In this case S = X^ - X^. If S is identically zero for a non- 
vanishing time intervals the possibility of singular control exists: 
By taking a suitable number of time derivatives , it can be shown 
that the singular control is given by u = -x^-x^, and that the 
singular arcs are two lines x^(t) = 0 ands-^t) + 2x 2 (t) = 0. 

Figure (l .6) shows two possible stationary solutions to the 
problem starting at point A, one of which has a singular subarc. 

The first arc AB is the same for both solutions. In both solutions 
u = -1 along arc BC and then follow arc CO to the origin with 
u = +1. This is the so-called "bang -bang" solution. Alternately} 
at point B one may elect singular control, u — and proceed 
to the origin directly along arc BO. 



12 


(1.2.5) 


Fig. 1.6 Multiple Solutions Arising from Singular Control 



Unfortunately j there is no guarantee that the solution with 
singular control is minimizing or that it will always enter the 
optimal solutions even if the possibility of singular solutions 
does exist. In this case the solution with bang rbang control, 
arc ABCO, . has -an index. .of performance almost 12 . per cent larger 
than for the true optimal control which uses the singular control 
arc BO.- 

Recently, Kelly, Kopp and Moyer (1967) and Robbins (1965) have 
developed a new necessary condition- ‘for testing the optimality of 
singular subarcs. 



A SECOND ORDER ENDPOINT CONDITION 

In this section a second order condition is developed for 
va'riational problems with- variable endpoints. Among the multiple 
stationary solutions that may exist ? • the . condition provides a'test 
for distinguishing those stationary solutions which are local minimums 
with- respect to endpoint variations along the prescribed terminal 
manifold. The second order endpoint condition is related to 
sufficiency conditions of the classical calculus of variations in 
section 2.4. In section 2.5 the second order condition is illustrated 
with- an example problem. Finally? a numerical algorithm is developed 
for applying' the endpoint sufficiency condition to problems with- no 
analytic solution. 

2.1 Functional ’Relationships for the Problem of Bolza 

Arguments used in Section 1 have implied that the conditions 
for the Problem of Bolza fall into two classes: those' pertaining to 

the path and those pertaining to the endpoints. • It has been 
further argued that conditions pertaining to the endpoints can be 
considered independently from those pertaining to path. Consider 
the Problem of Bolza as expressed in Section 1.1? equations (1,1.1) 
(1.1.5). In this formulation? and for the remainder of this section? 


the controls 


are assumed to be unbounded functions of time. In 




addition, it is now assumed that the Jacobian is not equal to zero 


(M 

^ 'bu ^ 5 bu 2 J 


bH \ 

’ a v 


& (u l5 u 2 , , u^ 


i 0 


( 2 . 1 . 1 ) 


for all points (u^, u 2 , ... , u^) in the control space. The 
function H has been- previously defined in equation (1.1.8). If 
equation (2.1.1) is valid,- the implicit function theorem (Buck, 

1965, pp_. . 283-286) assures the existence of the k functional', 
relations 

\ = ^Cyi(t), x^t)] (2.1.2) 

from the k control variable Euler-Lagrange equations (l.l.ll). 

Condition (2.1.1) specifically eliminates from consideration those- 
-systems in which any state variable derivative, y as defined, in • 
equation (l. 1 . 3)3 is a linear function of any of the control variables. 

A solution to the Problem of Bolza is specified by solutions 
for the state variables y^, as well as the control variables u^., as 
functions of time. A selection of the initial time and the final 
time completes the solution. To obtain these solutions, the control 
variable Euler-Lagrange equations are first 'solved .for the control 
variables u^ as functions of the state variables y^ and the adjoint 
variables The functions f^ (1.1.3) and the L function (1.1.5) are 

now explicitly dependent only on the state variables, the adjoint 
variables, and time. 


f i = u lc (y i (' fc )> *.(t)); t] 


(2.1.3) 
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L “I'CyjCt); ^(y^t), V.(t)); t] 


(2.1.4) 


Similarly the H function becomes an explicit function of the 
•state variables, the adjoint .variables, -the adjoint variables, 
and time, alone: 

H° = H°[y^(t) ; X.(t); u^y^/t), X^t)); 't] ' (2.1.5) 

Finally, it is evident that a similar functional" relationship exists 
for the time derivatives of adjoint variables. 

In summary, once the optimal control is selected, the state 
variable differential equations (1.1.3) and the adjoint variable 
differential ■ equations ( 1.1. 10) comprise a set of 2n first order 
nonlinear differential equations in the 2n state and adjoint variables 
and time. This set of differential equations can be integrated in 
theory, yielding 

Y i = *1^* c r^ r - 1, 2, 2n (2.1.7)’ 

x ± - x ± (t, c r ). (2.1.8) 

where. the c r 's are .constants of integration. The p end conditions 
(1.1.4) and the 2n + 2 equations representing the transversality 
necessary conditions (1.1.12) - ( 1 . 1 . 15 ) comprise a set of (2n + p + 2) 
non-linear algebraic equations in the 2n constants c , the p para- 
meters u.^, the initial time t Q , and the final time t f . 

The initial values (y iQ , X. f , t f ) are specified. Hence, the 
° r ’ s may be determined as a function of these initial and/or final 


16 



values by evaluating equations (2.1.7) and (2.1.8) at either the 
initial or final point. For example, a solution for the c^'s 
would be specified by the set (y^ Q , ^ 0 > > the se ^ 

( y if’ ^if’ t f’ t c) or the Set ^ y io 5 y if 5 t o’ V7hile i-t is 

difficult to attach any physical meaning to the initial or final 

values of the Lagrange multipliers, the initial and final values of 

the state variables have an immediate physical significance. For 

this reason, the state variable endpoints have 'been selected to 

functionally represent .the constants of integration for the 

rest of this section. Thus equations (.1.7) and (2.1.8) will be 

written as 


and 


' T a (t > y io’ y if *0’ V 


X 3 * ty‘’- y io’ y if’ V V 


t < t < t 
o f 


(2.1.9) 


( 2 . 1 . 10 ) 


By substituting the functional relationships exhibited in 
equations (2.1.9) and (2.1.10) into relations (2.1.2) - (2.1.6), it 
can be seen that the functions u^, L, f^, H°, and can all be 
witten as explicit functions of the set (t, y^ Q , y if 3 ^ 0 5 ^P * 
These functional relations, together with that for the function G 
from equation (1.1.7) are summarized for reference below: 


■ "k (t ’ y io’ y if’ ‘o’ V 

1 - L <‘> >‘io’ y if> ‘o’ V 


y a = f j (t ’ y io’ y if’ ‘o’ V 


\ 

a 


- P 3 (t ’ y io’ y if’ ‘o’ 



( 2 . 1 . 11 ) 

( 2 . 1 . 12 ) 

(2.1.13) 

(2.1.14) 
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(2.1.15) 


H 0 = H°[y 3 (t. y lD , y ifj t D , t f ); X^t, y^, y. f , t & , t f ); 


"k (t > y io’ y if’ W* t] 


G - G( V y io , y lf . V t f ) 


(2.1.16) 


So that there will he no confusion as to the meaning or rne subscripts, 
note that 


y jo B y j 


t=-t 


(2.1.17) 


X. = X. 
3° 0 


t=t 


(2.1.18) 


y jf = y d 


X = X 

of 0 


t=tj 


!t=t f 


(2.1.19) 

(2.1.20) 


Using the functional relationships summarized above form the 


* 


augmented function J = J + 4 where 

& & 

J(y i0 . y if , t d , t f ) » g(y l0> . y if . t o , t f ) 


( 2 . 1 . 21 ) 


+ f [-H 0 (t, y ±0 , y. f , t 0 , t f ) 


+ \j (t ’ V y if> *0’ V y io’ y if’ V V] 


dt 


By requiring the trajectory to satisfy certain necessary conditions 
regarding path, equations ( 1.1. 10) and (l.l.ll) , the Problem of Bolza 
has been reduced to the problem of minimizing J, a function of end- 
points, subject to the >i algebraic constraints on the endpoints. 

At 
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Before proceeding with the minimization of J,’ it is appropriate 
to consider a graphical interpretation of the functional relationship 
for the state variables expressed in equation (2.1.9)* Figure 2.1 
shows a general state function y^(t, y\ Q j ^f) as a 

of time. From the figure it can he -seen that a change in the final 
state Ay^ while holding all of the other, endpoints fixed causes 
a change in yu for all values of t. Likewise a change in the 
final time At f while holding all of the other endpoints fixed causes 
a Change in the state y\ for all values of t. 

Formalizing this graphical interpretation in terms of differentials 
yields results which will he of value in the following sections. Using 
equation (2.1. 9), the differential of the state variables may he 
written as , 


dy,.(t 3 y. n , y.^, t rt? t^.) = dt + dt Q + _5 jL dt f 


'jo 5 'jf 5 - o? f' 


bt 

by. 


bt 


o - 


by t 


bt , 


fy 3o ^0O + by. f 


( 2.1 


Evaluating this expression .at t = t gives 


by. 

<V yjo- y jf’ ■‘o-V - ^±t = -jj- 


dt , 


( 2.1 


^i 

bt 


byj 

dt o + ^; 




3° 


dy do + by. 


dy 






The sum represented by the last term in the above equation can be 
separated into those products for which i ^ j and that for which 
i = j . Transposing dyu^. to the right hand side 5 equation (2.1.23) 




becomes 


T by i 
°-[ * 


bt 


by ± 

btT 


'f 


] d1 


by, 


at- + 

f . by. 


dy 


30 


30 


^i 


by 


df 


ay if + T 




f 

; Md 


by 


3f 


L ]' ^3 


'■f 


df 
i=d 


(2.1.24) 


.In the above equation the repeated subscripts on. the last term 
do not .imply summation. _ Since t, t Q j t y^ Q > and y^ f have been 
assumed to be independent , equation (2.1.24) implies that 


bt 


bt „ 


(2.1.25) 


5y i 

bt 


= 0 


(2.1.26) 


by 

00 


= 0 


(2.1.27) 


5y i 
by „ 

y df 


= 0 


If 

li£j 


(2.1.28) 


*i 


= 1 


- 


(2.1.29) 


By evaluating equation (2.1.22) at t = t and following arguments 
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similar to the ones above, it can be shown that 


bt 


bt f 


(2.1.30) 


by i- 


bt 


= 0 


f- 


(2.1.31) 




= 0 


(2.1.32) 


^i 


= 0 


o 


(2.1.33) 


>i 

*3° 


- 1 
o 

i=j 


(2.1.34) 


These identities will be useful in the proofs of necessary and 
sufficient conditions in the next section. 


2.2 Derivation of Transversality Conditions 

In determining the functional relationships in the last section, 
it was assumed that the control and adjoint variables were chosen so 
as to satisfy the Euler -Lagrange Equations (1.1. 10 ) and (l.l.ll). 
Equations (1.1.10) and (l.l.ll). are referred to as the first path 
necessary conditions. In this section the endpoint necessary 
conditions (transversality conditions) are derived assuming that 
the first necessary conditions for path are satisfied. 



The solution to the path necessary conditions determines one 
or more trajectories (see section 1.3) 5 any of which may he- expressed 
functionally as a set [y_.(t, y ±Q> y ±f , t Q , t f ) , *-..(t, y iQ , y ±f , t Q ,_ t f ) . 
ii(t) y. , y.-, t , t J] as shown in section 2.1. Once the functions 
representing one of these trajectories is substituted into the integral 
-in equation- (2 .1.21) , the integration can be performed. It is there- 
fore clear that once .the trajectory is specified, J is a function of 
only the parameters y^ Q , y , 't Q and t^.. Specifying the path’ reduces 
the problem of minimizing J to the well-known problem -of finding the 
minimum of a function of several variables subject to algebraic 
equations of constraint (Bryson and Ho, 1969) • 

It is shown in Appendix A that if the arguments of J in equation 
(2.1.21) are to satisfy the constraints and minimize J , then it is 
necessary that the partial derivatives of the auxiliary function, 
shown below, with respect to y^ Q , y , t Q , and t^ .all be equal to 
zero . 


The J function is defined by J = j + n ilr where J is given 

a jo - 

by equation (2.1.21). Using the definition of the function G from 

f n\ ^ 

equation (1.2.8), J may be functionally represented as 



( 2 . 2 . 


In the above equation it is understood that y_^, and u^ are all- 
functions of the set (t, y,^ Q > y if ? t Q , t f ) . In writing the functional 
relationship shown above, it has been assumed that the controls u^ have 


been chosen in an optimal fashion in accordance with the control variable 



Euler -Lagrange Equation (l.l.ll). This is indicated by the super- 

* 

script o on u^ and on H. - The partial .derivative of J with re spec ' 
to y. can now he written: 

^ -i r\ 



Here Leibnitz Rule (Hildebrand, 1948, P- 360) has been used for 
differentiation of an integral with respect to a parameter. Using 
the identity 


5L 

dt 


Fh^-1 

L 1 by. J 


and expanding 


bH 


2 

b y • blp by 

= X. + — i- 

1 by^bt bt by jQ 

, equation (2.2.2) may be written as 

Q 


bJ 

by. 

jo 


bG + (* f r 5H 5H 5X i bH b \ 

So 4 L 


blj: by i ^ ^ 

.bt -by J dt + 


by . bt 
jo 


jo 


r S i * 


Terms under the integral sign may be combined to give 


* 


bJ 




jo 


So 1 


s_ 

. by. 

t f JO 


- X. 
1 




5y i 

t o 


t 

! o 


+ 



bH 

by ± - bt 


) 




bH bu^ 

s s;J dt 


( 2 . 2 . 2 ) 


(2.2.3) 


(2.2.4) 


(2.2.5) 
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Note from equations (2.I.9) and ( 2.1.10) that once the optimal 
endpoints have "been selected, 


by. dy. 

- i _ ^ x 

bt dt 


bX- dX- 

and i ~ 

bt dt 


The integral tern vanishes, since equations (1.1. 3) , (1.1.10) and 
(1.1. ll) were used to generate the functional relations (2.1.9) and 
( 2 . 1 . 10 ). 

Using equations (2.1.27)? (2.1.33) and (2.1.3^-), it can be 
concluded that the sums represented by the two remaining terms not 
containing G in equation (2.2.5) reduce to a single term, - X. 

With these considerations, equation (2.2.5) reduces to 


bj 


* 


bG 


by. 


by 


o° 


- x. 

o 


* 


= o 


( 2 . 2 . 6 ) 


By taking the derivative of J with respect to y^ and using arguments 
similar to those just presented (in this case equations (2.1.32), 
(2.1.28) and (2.1.29) must be taken into account), it can be shown 
that 


* 


bj 


bG 


+ X, 


= 0 


( 2 . 2 . 7 ) 


Of 


Two more necessary conditions remain to be derived. These result 

* 

from taking the partial derivatives of J with respect no the remaining 
two variables, t Q and t f . Performing. the first of these operations 
yields 
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+ r”f r™ 

bt bt . J 4. L 


tG 


bt 
o o 


bH 

bt 




bX^ by^ 

bt o ^r 


X. 1 


dt 


i btbt - 
o 


-[- 


H + X. 




bt 


3 


( 2 . 2 . 8 ) 


• Here again Leibnitz Rule has been. used; this time- the limits of - 
integration are functions of the differentiating variable. Using 
‘the identity 


d_ 

dt 




r x . — ~ i . ' = . x. 

L X bt J . x 


^4 ’ 6X ^4 


btbt 


+ 


bt . bt 


[ 2 . 2 . 9 ) 


^)H 

and expanding — — , equation (2.2.8) may be written as 
Ob 

o 


bJ_ 

bt. 


bG + f t f r zi . 
bt^ • • ■ L by. bt^ • 


bH by. bH bX^ bH bu^. 


bX. bt 
o 1 o 


bu^ bt Q 


+ 


bt bt 
o 


“i*i - ^ i2i ] at + r x f 

bt bt^ -I L 1 5t o ->t o 


-[- 


H + X. • — - 
i bt - 


] 


(2.2.10) 


Terms outside the integral may be evaluated at the endpoints 
indicated and terms under the integral sign combined to give 


_ * 
bj = 

bt bt 
o o 


+ H 


- X. 


+ \ . ! 

bYi 

- X. 

’ ^i 

t x 

t bt 

t i 

t„ bt 

t^ 1 

t bt 

o 

o 

o 

f o 

f 

o o 


fbH_ 1 Yl . fb[L _ — - ( "bH \ -j 

J + L vby. + bt / bt . bX 4 bt ' bt 'bn, / bt J 

t •> 1 o 1 oico 


(2.2.11) 


dt 
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The integral again vanishes identically for optimal paths. Using 
equations (2.1.26) and (2.1.30), the three terms outside the integral 
representing summations can also be equated to zero. With these 
observations, equation (2.2.11) reduces to . 





0 


' ( 2.2 


By tailing the derivative of J with respect to t f , following a 
line of -reasoning similar to that just given, and using equations 
(2.1.31) and (2.1.25)} it can be shown that 





0 


( 2.2 


These 'results are summarized in the following statement: 
Transversality Necessary Condition for Endpoints - If a trajectory 
satisfies the Euler -Lagrange and state variable differential equations, 
equations (1.1.10), (l.l.ll), and (1.1.3)} and if the set E = [y^ Q , 
t Q , t^, p. ] satisfies endpoint equations of constraint (1.1 A) and 
provides a local minimum of J with respect -to small allowable variations 
in the endpoints, then the set E must satisfy equations (2.2.6), 

(2.2.7)} (2.2.12) and (2.2.13). 

These latter equations are referred to as the endpoint necessary 
conditions or, classically, as the transversality necessary conditions. 


2-3 Derivation of Second Order Endpoint Conditions 

In the last section the function J was shown to be a function of 
the endpoint variables y^ Q , t Q , and t^ when evaluated along an 



optimal path. The function J is constrained, however, through the 
p equations of. constraint of equation (l.l,A). Second order 

Jo 

conditions for determining the minimum- of a function whose arguments 
must satisfy algebraic equations of constraint has recently been 
discussed. by Vincent and Cliff (1970, pp. 171-173). Their methods 
will be used here. For reference, a detailed discussion of the 
minimization of a function of several variables is included in 
Appendix A. 

Before presenting a statement of the second order condition, a 
brief discussion and definition of notation are in order. Since the 
algebraic equations of constraint for the Problem of Bolza' define 
relationships among the endpoint variables, the endpoint variables are 

o 

not all independent. Since there are p equations of constraint and 
(2n + 2) endpoint variables, there are only (2n - p +2) independent 
endpoint variables. The p dependent variables are determined by • 
the p- equations of constraint. Any p of the variables can be con- 
sidered to be- the dependent variables. The choice is. one of convenience 
Let the p dependent variables be denoted by the column vector w and 
the rema in ing (2n - p +2) independent variables be denoted by the 
column vector v . Let the vector jlr represent a vector whose elements 

are the & constraint functions. Equation (2-3-1) summarizes these 

JL 

relations . 


.^(w, v) 


V 1 


V 1 

i 2 (w, v) 


W 2 


y 2 

• 

w = 

■ 


• 

• 


• 


• 

• 


* 


• 

■ * 3 (w, v) 


w 

P 


V 

q. 


q = 2n - p + 2 



The identification of the elements of v and the elements of w 

with the endpoints y. , y . t } and t_ is arbitrary except that 

10 if _ o 1 

the set of ijf equations must contain every element of w and, in 
^ . 

addition, every 4 equation must contain at least one element of 
JL 

w. It is convenient to define an additional column vector r, whose 
first elements are the dependent variables and last elements the 
independent variables: 


w_ 


w 


v. 


[ 2 . 3 . 2 ) 


v- 


With these vectors define — ■ — as a (2n + 2.) by (2n + 2) matrix 

_ brbr - 


with elements a. . - 


bJ 


ID 6r.br. 
i D 


brbr J 

Let the matrix s be defined by 


* = -fM-i" r^Li 

L bw J L bv . J 


( 2 . 3 - 3 ) 


where 


r bt 

L bT J 


is a p by p matrix with elements 



r- b^ . 64 i 

is a p by q matrix with elements a. . = - — 

L bv J- - bv. • 

“ D 



and 


It is 


shown in Appendix A that the © matrix is the linear transformation 
which transforms differential changes in the independent variables 
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into differential changes in the dependent variables. The o matrix 
has p rows and q columns. Finally, define the (2n +2) by q parti- 
tioned matrix Q as 

n = [-|-] (2.3.4) 

where I represents a q by q identity matrix. 

With these definitions the second order endpoint condition may 
now be stated: 

Second Order Endpoint Condition . If E represents a set of endpoints 
and multipliers [y^ Q , y if j t Q , t f , p,^] which satisfy the transversality 
necessary condition for endpoints, and if the set E represents a local 
interior minimum of the function J with respect to small allowable 
variations in the endpoints then the quadratic form 

dv T n T 1 Q dv (2.3.5) 

*- 6rbr -* 

in the differentials dv is positive semi-definite when evaluated at 
the stationary point -E. 

To implement this test, it is necessary to evaluate the elements 

P . * _ 

of the matrix 9 and the elements of the matrix j _ — . Evaluation 

L hrbr -J 

of elements of § represents no problem since the functional form of 

the constraints is specified in the problem statement. However, the 

* 

analytic evaluation of the second partial derivatives of J with 
respect to the endpoints is not so simple. 

The second partial derivatives of J can be obtained by taking the 

J. 

Conversely if the transversality conditions hold and if (2.3*5) is 
positive definite then the set E is a local interior minimum (sufficiency). 



partial derivatives of the transversal ity necessary conditions with 
respect to. the endpoints r^_. 

(2-3-6) 
( 2 - 3 - 7 ) 
( 2 - 3 - 8 ) 
( 2 . 3 . 9 ) 

where in the above equations i = 2, ...» 2n + 2. The functional 

form of G as a function of the endpoints is specified by. the statement 
of the problem. However, the functions X. and H are not known functions 
of the endpoints until the state variable and Euler-Lagrange differen- 
tial equations have been integrated analytically. 

Since analytical integration is often difficult or impossible, 
it would be desirable to evaluate the partial derivatives of X^ and H 
with respect to the endpoints in terms of functional forms specified 
in the statement of the problem. A complete set of relationships of 
this type were not found. Unless future investigation establishes 
such relationships, analytic application of the second order condition 
requires an analytic solution of the state variable and Euler-Lagrange 
differential equations. 

Some interesting relations of this type are easily obtained however. 

* 

Each of the elements of the matrix | 1 is composed of a sum of 

L brbr J 


— ( 

bJ \ 

bG: 

br. v 
1 

* do 

br-by • 
x 

6 1 

fbJ* ) 

bG 
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\t 

'b'V 
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bG 
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br . bt 
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bG 


r~ ( b ) 

)r. d t 
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a- second term. The matrix can therefore . he expressed as the sum of 
two matrices. 



(2.3.10) 


where the matrix A is determined from equations (2. '3* 6) - (2.3-9) 

, ' ■ p ■ • • * 

Since J and G are' of class C by hypothesis, both J and G must 
be symmetric about their major diagonals. The obvious conclusion 
is that matrix A must also by symmetric. By equating symmetric elements 
of A, the follovring identities can be established: 


5X io 

»jo 

bX ±f 

b\if 

by. 

6y io 

V ' 

by if 

bH 

0 

»io 


b)i if 

11 

0 

•H 

£ 

bt 

0 

by lf 

‘ bt f 

bH f 


bH 

O 


l 

0 

"H 

& 

bt f 


bt o 

»if ' 

bljo 

bH f 

bH 
0 


6y if 

^7 

" . bt f 


(2.3.U) 


(2.3.12) 


(2.3.13) 


(2.3.14) 


In addition, the .following relations can be established by considering 
the functional relationships exhibited in section 2.1. 




b\ if 


’if. bt. 


(2.3.15) 


bH, 


5y if 


5x if + f 

bt f it by. f 


(2.3.16) 
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Similar equations exist for the initial point. 

Unfortunately, a sufficient number of these relationships have 
not been found- to determine the elements of A in terms of known 
functions in the problem statement. The determination of further 
relationships and the ultimate determination of the elements of A 
without resort to analytical integration of the state variable and 
Euler-Lagrange equations poses an interesting problem for future 
inve st igations . 


2.k Relation to Classical Theory 

Bolza (1961, pp. 102-103) gives an excellent summary of the 
various classical approaches to the development of necessary and 
sufficient- conditions for variable endpoint .problems. ■ Because of 
the pertinence of his remarks to this presentation, his historical 
synopsis is quoted in detail: 


Three essentially different methods have been proposed 
for the discussion of problems with variable endpoints: 

+ 

1. The metho.d of the Calculus of Variations proper : 

Z 2 

It consists in computing 6J and 6 J either by means of 
Taylor's formula or by the method of differentiation with 
respect to , ...and discussing the conditions 6 J=0 ... 

2. The met had of Differential Calculus : This method 

is explained in general way in Dienger ' s Grundriss des 
Variationsrechnung (1867). It decomposes the problem into 
two problems by first considering variations which leave 
the endpoints fixed, and then variations which vary the end- 
points, the neighboring curves considered being themselves 
extremals. . The second part, of the problem reduces to a . 
problem of the theory of ordinary maxima and minima. This 
method has been used by A. Mayer in an earlier paper on 

the second variations in the case of variable endpoints 
for the general type of integrals mentioned above ( Leipziger 
Berichte (1884), page 99)- 


^ . The first and second order variation of the integral are written as 

p 

6J and 6 J, respectively. Variations in the endpoints and in the 
path are considered simultaneously in this method. 
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It is superior to the first method not only on account of 
its greater simplicity and its more elementary character, - 
but because by utilizing the well-known sufficient condi- 
tions for ordinary maxima and minima it leads, in a certain 
sense, to sufficient conditions it combined with Weier- • 
.strauss’ s sufficient. conditions for the case of fixed end- 
points... . 

3. Kneser’s method : This method, which has been 

developed by. Kneser in his Lehrbuch^, is based upon an 
extension of certain well-known theorems on geodesics. . 

It leads in the simplest way to- sufficient conditions, 
but must be supplemented by one of the two preceding 
methods- for an exhaustive treatment of the necessary 
conditions. . . 


While Bolza (I96I, pp. 104-109) used method 2 for investigating 
the simplest classical problem with variable endpoints, and later 
Bliss (1932, pp. 261-266) used the same method for the classical 
problem of Bolza, more recent work, e.g. [Householder (1937* 
pp. 485-526) , .Bliss (pp. 147-184), and Hestenes ( 1966 , . pp . 296-351)] 
have utilized the first method quoted from Bolza. 

Sufficiency .conditions for. the problem of Bolza can be obtained 
by employing either method. However, the type of normality assumptions 
used differ from one method to the other, and the first approach 
apprently gained favor because it requires less stringent normality 
conditions. [As opposed to the second approach without modification, 
see for example, Bliss and Hestenes (1933> PP* 305-326) for a modi- 
fication of method 2] . 

In this presentation, the second method was employed because 
of its simplicity. We were not seeking a sufficiency condition for 
the problem of Bolza in control notation per se. Instead a less 
ambitious project was investigated. We sought conditions to be 
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Lehrbuch der Variationsrechnung Braunschweig (1900) . 



satisfied for a given extremal to "be a local minimum with respect to 
endpoint variations along a prescribed endpoint manifold. 


2.5 Geodetic Example 

As an example of the application of the second order condition for 
endpoints, consider- the problem of determining the minimum distance 
from the origin to any point on a parabola of the form 

y = x^ + b (2.5*1) 

In control notation the problem may be formulated as follows: 


Minimize 



o 

subject to the state- -variable differential constraint s,- 


( 2 . 5 . 2 ) 


= sin g, (2.5.^) 

ds 

and endpoint constraints, 

y o = 0 , (2.5.5) 

x = 0 , ( 2 . 5 . 6 ) 

o 

s = 0 , (2.5.7) 

o 

y f - - b = 0 . ( 2 . 5 . 8 ) 


The angle g is the angle between the positive x axis and a tangent 
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to the curve. Here x and y are the state variables, g is the control 
variable, and s is the independent variable analogous to t in the 
formulation of earlier sections. 


Necessary Path Conditions - The H and G functions are 


H = X cos g + X sin g - 1 
x & y 


(2.5-9) 


G = -b) + p 2 x q + p- 3 x q + P- i| s o 


(2.5.10) 


The ad joint- variable Euler-Lagrange equations are 


X = 0 

y 


X = 0 

x 


(2.5.11) 

(2.5.12) 


and the control -variable Euler-Lagrange equation is 


_ X sin g + X cos g = 0 

x & y 


(2.5.13) 


Equations (2. 5.1l) and (2.5.12) imply that X and X are constants, 
'Solving equation (2.5*13) for the control 


tan g = — ^ = constant 

X x 


(2. 5-1*0 


which implies 


sm g = 


/ 2,2 

V ^-x X y 


(2.5.15) 


cos g = 


n 


4 


(2.5.16) 
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The positive sign on the radical is a consequence of the Legendre- 
Clehsh necessary condition (1.2.17). 


Functional Relations - Integrating the state variable equations 
(2.5.3) and (2.5.4) with the optimal constant control g between the 
general initial- point (x , y , s-) and general final point (x„, s ) 
results in 


x „ - x = (s_ - s .) cos g - (2.5*17) 

f o ' f o & 

y f - y Q = (s f - s Q ) sin g (2.5.18) 


Solving for the control 

O 

tan g = 


• y f ' y o 


x -p " x 

I o 


Squaring both sides of equations (2.5*17) and (2.5*18) and adding 
yields the identity 


W - s o )2 * (x f • X o )2 + < y f - y o )2 


Solving equations (2.5*17) and (2.5*18) for the controls gives 


(2.5.19) 


( 2 . 5 . 20 ) 


and 


cos g = 


son g = 


X f ~ X o 
S f " S o 


y f - y Q 

S f “ S o 


(2.5.21) 


(2.5.22) 


Since the control is constant, the control is not a function of the 
independent variable in this case. For other problems the control may 


be a function of the independent variable as well as the endpoints. 

Integrating the state variable equations again between the general 



initial point (x q , y , s ) and general intermediate point (x, y, s) 
and substituting the optimal control from equations (2.5.21) and 
(2.5.22), and rearranging yiexas 


x 



y 


y o 




(2.5.23) 


(2.5.24) 


It is seen from the above equations that the state variables are clearly 
functions of coordinates of the initial and final state variables and 
of the initial and final values of the dependent variable. 

The first Integral of the Euler-Lagrange equations is 


\ cos g ’+' \ sin g - 1 = 0 

x -y 


(2/5.25) 


Solving this equation with equation (2.5*13) tor X and' X and 

t7 

observing equations (2.5>2l) and (2.5*22) gives 


x f ~ X o 
^x s„ - s_ 


X . - X 

f o 


J (x f - x 0 y 


+ (y f -y 0 ) £ 


(2.5.26) 


V 


y f : y o 

S f ’ S o 


y f - y o 


7 (x f -x Q ) 2 + (y f -y Q ) 


(2.5.27) 


Two forms are given above for the Lagrange multipliers as functions of 

* 

endpoints; either is correct. If the second set is used, the J function 
will be independent of s^, and s q . In either case it is clear that the 
Lagrange multipliers can be written as explicit functions of the coordinates 



of the initial and final values of the dependent variables. Equations 
(2.5.19), (2.5.23), (2.5.24), (2.5.26), and (2.5.27) hear out the 
functional dependencies hypothesized for control, state, and adjoint 
variables in section 2.1. Note that in deriving these equations, 
only path necessary conditions have been used. The transversality 
necessary conditions for endpoints have not been used. 


Necessary Endpoint Conditions - The transversality conditions, 
(2.2.6), (2.2.7)? (2.1.12), and (2.2.13) yield the following equations 


bJ 


a - 1 
2 yo 


= 0 


(2.5.28) 


* 


bJ 

bx 


H _ 1 
3 xo 


= 0 


(2.5.29) 


— =4, + H =0 
bs 4 o 
o 


(2.5.30) 


32- . * + * _ - 0 

by f 1 


(2.5.3.) 


^-- 2 Vf + X xf ■ 0 


(2.5.32) 


* 


|r " - cos g f - V sln g f + 1 = 0 

OSf 


(2.5.33) 


Since the initial point is fixed, the initial point transversality 
equations give no useful information. 

To find the optimal endpoints, eliminate between equations 
(2.5.31) and (2.5.32), yielding 


39 



0 


(2. 5-3*0 


**f " 2A yf x f = 

Substituting X xf and X f from equations (2-5-26) and (2-5-27) into. - 
equation (2-5.- 3*0 yields 


x f - x o 


2(y f - y 0 > x f 


u 


x f -x o ) + (y f -y 0 ) £ 


V (V X o )2 + (y f“ y o^ 


= . 0 


(2-5-35) 


Finally, multiplying through by the radical and imposing endpoint 
constraints (2-5-5) and (2-5-6) gives 

x f (l + 2y f ) = 0 (2-5-36) 


The necessary conditions are satisfied if either term in the above 
equation is equal to zero. Solving equation (2-5-36) and equation 
(2-5-8) simultaneously gives the two solutions 


— • ± 


7- 


1 

2 


- t y- = 


- ~ (solution A) 


and 


(2-5-37) 


x „ — 0 


y = b (solution B) 


These endpoints and the corresponding multiple solutions for 
b < - ~ are shown in 'Figure 1-3 on page 11. From the symmetry of 
the parabola, it is expected that either the plus or the minus sign 
in equation (2-5-37) will determine a solution giving the same value 
of distance. For this reason a distinction has not been made between 
the two. The necessary conditions used so far have provided no 
means for determining under what circumstances solution A (or solution B) 
is the optimum. In this case of multiple stationary solutions, the 
endpoint second order condition will provide a means for determining 



the true optimum. 

Before examining the second order conditions} the paramenter 
will he evaluated in terms of the general endpoints for future 
reference. From equations (2.5*27) and (2.5*3l) it is observed that 


{i = - X = _ 

l yf 


y f " y o 


J (x f -x o f + (y f -y D )‘ 


(2.5.38) 


Second Order Endpoint Condition - To evaluate the endpoint 
sufficiency conditions} it is instructive to first determine the 
5 'and q matrices of equations ( 2 . 3 * 3 ) and (2.3*4) • The constraints 
are 


♦ 

1 : 


y_ 


= 0 



x = 0 

o 


4 

3: 



s 


o 


0 



0 


(2.5.39) 

(2.5.40) 

(2.5.41) 

(2.5.42) 


Since there are four equations and six endpoints, there are two 
degrees of freedom. For conveniency let x f and s f be the independent 
variables and y ^ 5 x qJ 5 and y^. be the dependent variables. Then in 
the notation of section 2.3 



(2.5.43) 
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Evaluating the matrix of partial derivatives of Jf with respect to 
the independent variables gives 



0 


Evaluating the matrix -of partial derivatives of jf with respect to 
the dependent variables gives just the identity matrix 


bjr_ 

bw 


10 0 0 
0 10 0 
0 0 10 
0 0 0 1 


The inverse of this matrix is obviously the identity matrix. From 
equations (2.5.44) and (2.4.45) the ® matrix can be computed 


r "| 

bi 

L bw 

L tw J 


0 . ■ 0 
0 0 

0 0 

2x 0 


The 0 matrix is formed by adjoining the I an identity matrix with the 
dimensions equal to the number of independent variables. In this case 
there are two independent variables. The fi matrix is 


Q = 



0 

0 

0 

2x f 

1 

0 


0 

0 

0 

0 

0 

1 


With the 


. * 
bJ 

brbr 




not yet 


evaluated, the 


endpoint sufficiency condition 


.5.44) 


.5.45) 


2.5.46) 


-5.47) 



reduces to the condition that 


4x 


2 b 2 j* 


by f 


2x, 


.2 * 
b J 




[dx f ds f ] 


.2 * 

4x f 

^t bx t 

,2 * 

b J 

bx 2 


2 * 

2x,<^— + 


f - &s f i by f 

.2* 

b J 


bs € bx f 


.2 * 
b J 

bx f b Sf 


b 2 J* 
bs 2 


dx. 


ds^ 


(2.5-48) 


must be positive definite. 

If J can be written so that is is not a function of s f , the 
sufficiency condition will be reduced to a simple inequality involving 
dx f only. From the transversality equations (2.5-31) - (2-5*33) and 
the functional relations for l x and Ay 5 equations ( 2 . 5 * 26 ) and (2.5*27): 
it is seen that this can be done . 

Therefore j the sufficiency condition reduces to the condition that 


r 

L 


4x. 


2 * 
b J 

tt 


+ 4x. 


.2 * 

b J 


x 2 * 

b j 

bxo 2 


] 


dx. 


> 


(2.5*49) 


This result is identical to the result that would have been- 
obtained if the fixed endpoint coordinate s, y , x Q> and s q had been 
excluded from the G function. This situation is similar to the trans- 
versality necessary conditions in that the initial points yield no 



information. From this example and previous experience with endpoint 
conditions, the following conclusion is drawn: 'ho useful information 
concerning, either necessary or sufficient conditions results from 
including in G constraints which merely fix a given endpoint coordinate. 

Substituting the functional forms for X^ and X^ not involving 
s^ fran equations (- 2 . 5 * 26 ) and (2.5*27)- into the first partial 

* / X 

derivatives of J with respect to y and x f in equations (2.5.-31) and 
(2.5*32) gives 



M-i + 



" y o 

+ (y f -y Q ) 


2 


(2.5.51) 



-2p 1 x f . + 


X - X 

f o 


■ J (x f -x o ) 2 + (y f -y 0 )‘ 


Forming the required partial derivatives results in 


2 * 
5 J 

bx 2 
f 


~ 2p, n + 


(x f -x o ) 2 + (y f -y Q ) 2 --x f (x f -x o ) 


D 


,2 * 

b J 

by f bx f 


(x f -x o ) (y f -y 0 ) 
D 


il 

by ^ 2 


( V x o )2 + (y f- y o )2 “ y f (y f- y o ) 


D 


2 * 

5 J 

bx f by f 


(y f -y Q ) (x f -x o ) 
D 


where 


D =■ [(x f -x Q ) 2 + (y f -y Q ) 2 ]2 


(2.5*52) 


(2.5*53) 


12 . 5*5 


fp. S.S5) 


(2.5.56) 


(2.5.57) 
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Comparing equations ( 2 . 5 * 5 ^) and. ( 2 . 5 * 56 ) verifies the symmetry of the 
2 * 

I — — 1 matrix. 
h‘ hr* hr -J 


Evaluating these derivatives using initial point constraint 
equations (2.5.5) and (2.5*6) and u from equation (2.5*38) and 
substituting them into the endpoint sufficiency condition, equation 
( 2 . 5 - 50 ) gives 


4 x 4 
' X f 

— - 3 / 

< f - > 


4 x 


r x i^f 

f L 7 2~ 2,3/2 

(x f + y f ) ' 


1 


+ 2 


Vx/ + 


y-f 


(2.5.58) 


+ 


7 2 " 2 >. 3/2 

(x- + y . V 


> 0 


for solution B (x^ = 0 , = h) this condition reduces to 


£ + 2 > 0 
b 

From the geometry in Figure 1 . 3 5 it can be seen that b is negative. 
The condition therefore requires that 


( 2 . 5 * 59 ) 


0 > b > -2 


(2.5*60) 


Then solution B as shown in Figure 1.3 is optimum. 


For solution A (x f = - 


+ /' 


V- 


1 


- b 


? ^"f " 2 ^ ^ 


the end sufficiency condition (2-5*58) becomes 


45 



4b 2 + 2b + - 


■T 


> o 


- b 


(2 


Combining terms yields 


4b 2 + *3b + | 


(-r - b) 


W 


> 0 


(2 


In order for the denominator to be real 


b < - 


(2 


Under this condition inequality (2.5*62) is satisfied only if 


b < . 


(2 


Therefore solution A shown in Figure 1.3 is optimum for b less than 
1 

- — . The optimal solution is summarized below. 


X f = °> y f = b 


0 > b > 


(2 




- b » y. 


f 2 


b < - 


(2 


This simple example has been analyzed in great detail to emphasize 
the concepts developed in earlier sections and to reinforce and illus- 
trate the notation. 


2.6 A Numerical Algorithm 

In order to apply the second order endpoint condition, the matrix of 

* 

second partial derivatives of J with respect to its arguments must be 


.5.61) 

•5.62) 

.5.63) 

•5.64) 

.5.65) 

.5.66) 
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determined. From equations (2.3*6) - (2*3*9) it is seen that each of 
these second partial derivatives .is composed of two terms. The first 
terms, in all cases, is a second partial derivative of the function G. 
This derivative can he computed analytically from information given 
in the statement of the problem. The second term of each second 
partial derivative of J can he written in one of the following 
forms: 


EM 


t=t 


f 


5 


E>r . 

ro 


Em 


t=t 


6r if 


y 


or 




y 



E>r. 

xo 


y 


where M represents any of the quantities H, 1^, ... , and r 

represents any of the state variables y^ or the independent variable t. 

These derivatives cannot be evaluated analytically without 
obtaining an analytic solution to the set of state variable and 
Euler-Lagrange differential equations. For most problems of practical 


( 2 . 6 . 1 ) 


( 2 . 6 . 2 ) 


(2.6.3) 


(2.6.4) 
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interest in the calculus of variations} the set of nonlinear state 
variable and Euler -Lagrange differential equations cannot he inte- 
grated analytically. Therefore, the implementation of the endpoint 
sufficiency- condition in most cases requires the numerical computation 
of partial derivatives of the forms expressed in equations (2.6.1) - 

(2.6.4) . 

Fortunately, this is not conceptually difficult for most problems 
in engineering which have separated end constraints. End constraints 
are separated if none of the endpoint constraints involves both 
initial values and final values; the constraints always related initial 
values to other initial values, or final values to other final values. 

The function M evaluated at t = t will be indicated by a subscript o: 


M o = M b (y io’ y if’ V V 


(2.6.5) 


The function If evaluated at t = t will be indicated by a subscript f : 

i 


M f - V y io’ y if’ V V 


Before the second order condition test is applied, the problem is 
first solved using the necessary conditions yielding nominal endpoints 

# 3$. x- * ■X' 

V j y. _ , t , and t„ and nominal Lagrange multipliers X and X. 
lO if o f lO J. 


1± 


* . . * * 
For brevity, let r^ represent a vector with elements (y-j Q , Yp a > 


y no 5 o 


* 

^f 


tj, and r_p represent a vector with elements (y lf ., y pf o 
. * 


'lo 5 2o" 
* * 

^lf 5 


* 

^nt 


tp , and M be a function evaluated with the nominal endpoints 
Numerically the derivative (2.6.1) can be approximated as 


tM 


f 


. * * * *v * 

M P r lo’ r 2o’ r io + -P ~ M f 


6r . 

10 


(2.6.7) 
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where A is a small change in the nominal initial variable r^ Q . If 
the state variable and Euler -Lagrange equations are then numerically 
integrated forward with the nominal Lagrange multipliers, the final 
nominal endpoint will not be reached. The n initial Lagrange multi- 
pliers must be adjusted in order to obtain the final nominal endpoint 
again. Since the n initial Lagrange multipliers give only n degrees 
of freedom, the nominal endpoint can be reached only if M is a function 
of n or less then n independent final values. This will be true if 
there is at least one equation of constraint involving the final values. 
With these new multipliers, the differential equations are integrated 
forward to the final point r^. M_,, is then evaluated from the resulting 

final Lagrange multipliers and rm,. With evaluated, the desired 
partial derivative can be evaluated using equation (2.6.7)* 

The derivative (2.6.2) can be approximated numerically' as 


o 



M (r*; 
o—o 


* * 
r lf 5 r 2f’ 




( 2 . 6 . 8 ) 


In the above equation, M q is evaluated by making a small change in 

Y _ 

r^, while leaving all the other values unchanged.. A set of 'final 

Lagrange multipliers is then determined so that a backward numerical 

* 

integration in time will yield the nominal initial values r Q . The 

quantity M q is evaluated using the resulting initial Lagrange multi- 
•3C - 

pliers and r^. With M q computed in this manner, the desired partial 
derivative can be evaluated using equation (2.6.8). 

The derivative (2.7.3) can be approximated numerically as 



/ * 

* 

* 

* 


r lf’ 

r 2f 5 ‘ 

. . , r . ^ + 

1-L 


A 



(2.6.9) 


h9 



Here, is evaluated by making a small change in the nominal final 

* 

point coordinate r^, while leaving all of the other final coordinates 

* 

and the initial point unchanged. A set of initial Lagrarge 

multipliers is then determined so that a forward integration from the 

nominal initial point will yield the varied final point (r^ , r^f , . . . , 

. 

r^ + A, The forward integration is then performed to the 

varied final point, and is evaluated using the resulting final 
Lagrange multipliers and the coordinates of the varied final point. 

The final derivative (2 .6.U) can be approximated numerically as 


hM 

o 



m ( 


* 


'lo 5 2o ; 


* 

* r io + 




) - M 


( 2 . 6 . 10 ) 


Here, is evaluated by making a small change in the nominal initial 
* 

point r. , while leaving all of the other initial coordinates and the 
xo 

* 

final point r^ unchanged. A set of final Lagrange multipliers is then 
determined so that a backward integration in time from the nominal 

•- * * * 45 - 

final point will yield the varied initial point (r^ Q , r ^ 0 , ..., r^ Q 

+ A, ...). The backward integration is then performed to the varied 

initial point, and M q is evaluated using the resulting initial Lagrange 

multipliers and the coordinates of the varied initial point. 

Using the above techniques, the matrix of second partial derivatives 
* 

of J with respect to its arguments can be evaluated. Because of the 

identities (2.3.H) - (2.3*1^) , there is some choice as to which of 

the above derivatives is used to evaluate the sufficiency condition. 

It is a simple matter to numerically evaluate the matrix Q from the 

o * 

T o"J 

nominal initial and final points and to test the matrix Q 0 


hr hr 


for positive-definiteness. 
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CONCLUSIONS 


It has "been shown that once the first necessary path conditions 
have been applied, a calculus of variations problem with variable 
endpoints is reduced to a problem of the minimization of a function 
of several variables. 

Analytical application of the second order endpoint condition 
requires the analytical integration of the set of state variable and 
Euler-Lagrange differential equations. Since in most cases this is 
difficult or impossible, the algorithm developed for the numerical 
implementation of the second order endpoint test should be an 
effective computational tool in complex applications. For example, 
through the use of the second order endpoint test, a complete class 
of nonoptimal solutions can be discarded immediately upon encounter. 
Without the aid of the second order endpoint test an investigator . . 
would have no indication that solutions he is generating are non- 
optimal whether he encounters multiple solutions or not. 

It could be argued, when multiple stationary solutions are obtained, 
that a comparison of solutions would quickly yield which one was 
optimal. However such a comparison technique fails if it is not known 
apriori exactly how many multiple solutions exist. One has no criteria 
in general' for determining in advance just how many multiple stationary 
solutions a problem may have, so that a direct comparison technique is 
unreliable unless every multiple solution is somehow found. 


51 



APPENDIX A 


. . MINIMI ZATION OF A FUNCTION OF SEVERAL VARIABLES 

In the proof that follows, free use will he made of the notation 
and conventions established at the beginning of section 2.3* Following 
the methods of Vincent and Cliff (1970) , consider the problem of 
minimizing a function' of several variables 

J = J(w, v) (A.l) 

subject to the constraints 

4 (wj t) = C) 

p 

where both J and are functions of class C and the constraints are 
such that the determinant of the Jacobian 

[|] 

is nonsingular. The dimension of jjr and w is assumed to be p and the 
dimension of v is assumed to be q.. 

A.l Method of Implicit Functions 

Since Jr is of C 2 and condition (A.3) has been postulated, the 
implicit function theorem (Buck, 1965 a PP- 283-286) states that equation 
(A. 2) implicitly assures the existence of the vector function W explicitly 
relating the dependent variables w to the independent variables v 

w = W(v) = 


W^v) 

W 2 (v) 


w p (v) 


(A.4) 


(A.2) 


(A.3) 
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By substituting (A. 4) into (A.l), J becomes a function of v only 

j = ' . j(w(v) , v) 

Define the general value of independent variables v in a small 
neighborhood of an optional point v° 

o 

v = v + e c 

where c is a vector of arbitrarily chosen small, but non- zero, 
constants. and e is a scalar multiplier. Then, from (A.l) and (A. 2) 

J = J [W(v° + e ^) , v° + e c] 

'!( [W( v° + e c) , v° + e c] = 0 

Now J is a function of e only, and the necessary condition for 
an ordinary local extremum is 

T T 

dJ _ bJ' h + frJ c ~ o 

de bw ” b\r — 

bw. 

where h is the vector with elements h. = — - . The vector h 

1 be 

represents changes in the dependent' variables w corresponding to 
the changes _c in the independent variables. Differentiating equation 
(A. 8) with respect to e yields 



(A.5) 

(A. 6) 

(A. 7) 
(A.8) 

(A.9) 


(A. 10) 


Solving for h yields 



Substituting (A.ll) into ( A-9) and. rearranging giver 


dJ 

de 


bJ 

bv 


.T 


‘ T -1 

bJTb'lr 
bw 


r«i mi i £ - o 

L tw J L b» J J 


Since c is an arbitrary nonzero vector, each of the elements of the 
vector in parenthesis must be equal to 0 at the optimal point. 



= 0 


If equation (A. 13) is satisfied, then a further necessary condition 
for a local interior minimum is that 



v 

o 


must be positive semi -definite for arbitrary values of h and _c 
satisfying equation (A.ll). 

Before evaluating this expression, an identity for taking the 
partial derivative of the inverse of a matrix must be developed. 


Let A. . represent a general element of the 



matrix: 


a.. = rin" 1 

- Then in indieial notation the definition of inverse may be 
expressed as 

V - Vi 

m 


^ Equation (a. 13) and (A.lU) positive definite is sufficient for a 
local interior minimum. 


(A. 12) 


(A.13) 


(A. 14) 


(A.15) 


(A.16) 



where 6 . is the Kroniker delta. Premultiplying hy A. gives 
<3.0 1( 1 


Nr 

A. . = A. — ^ A . 

ij i<3 bw ®-0 

m 


Taking the partial derivative of both sides yields 


bit 


(A..) = — (A. ) 5 . + A. — l — * ) A . 

br !0 br X< 1 13 *1 br V bw / 

n n n • m 


+ 6. (A .) 

xm br^ mj 


Since 8-.. represents a constant, equation (A.l8) reduces to 
**-J 

~ U.J = ■£- (A ) 4- A -2- ( -^ ) A. + T 5 - UjJ 

br 3-0 br 3.J iq br v bw mq br 2.3 

n n n m u n 


which gives the desired identity: 


*-— * (A. .) = 

br n xo 


-A- ■£ 

iq br 


s M \ 

(t-0 


n bw 


A . 

mg 


m . 


From this point- on results must be expressed in indicial notation 


bit 


since 


br 


n 


f M. \ 

( — — j is a tensor. In indicial notation equation (A-12) 
c m 


becomes 


dJ 

de 


■ [ 


bj bd 


bit 


A. . — - c 

bv. bw. 3-0 bv. J 

.K. . 1 * K 


k 


Using equations (A.20) and (A.2l) the second order condition (A.l4) 
becomes 


(A, 17) 


(A. 18) 


(A. 19) 


(A.20) 


(A. 21) 


0 
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a 2 j 

de 2 


b 2 J . b 2 J 

bw . bv, ^ i bv bv, k n 


i k 


n k 


b 2 J 

bw bw. ’"'ij -by, ^k n bv bw. bv k n 


bt 1 

A . . — “■ c, h 


A. . C.O. 


n i 


n i 


k 


bJ 


A. 


b\ 


bit. 


A . 


c, k 


bw . ib bw bw i&D bv, k n 


n m 


(A. 22) 


2 

bj b tjf- Mr.: 

T" A . - M . A . -—I c,c 

bw, iq bv bw mj bv, k n 
i n m k 


bJ 


A. . 


2 

b^ 


c, h 


bw. ij bw bv, k p 
i p k 


bJ 


A. . 


2 

6 


x , .n. . C, C 

bw. xj bv bv k n 
1 n k 


The indie ial notation representation of equation (A.ll) 


h 


m 




. (a. 23) 


appears in four terms of equation (A. 22) . Regrouping terms, (A. 23) 
can be written 


d 2 J 

de 2 


■ (■ 


bJ 


2 

b *. 



bv. 

1 

A. . 

13 

bv bv. 
n k 

b 2 J 

bJ 

A . 

2 

b 4 . 
0. ' 


1 k 


i k 


) 


c, c 
k n 


e, h. 
k 1 


(A. 24) 
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(A. 24) 


x 2_ 
f b J 

*.bv bw . 
n 1 


■ x 2 

f b J 

'.bw'.bw. 
3 i 


bJ 


A 




bw pq by- bw. 
p n i 


h.c 

J in 


2 

x b ii n 

M- a — q 

bw PI bw 
P i 3 


VS- 

h.h. 
.bw J i 3 


Equation (A. 24) and equation (A. 13) provide a set of first and 
second order conditions for j(w> v) to be minimum subject to the 
constraints £(w, v) = 0. 

A. 2 Method .of Lagrange Multipliers 

The first and second order conditions can be put in a form which 
is more convenient to«use by defining the augmented function 

-X' 

J (W) Vj p) = d(wj v) + j£ jk( w, v) 


(A. 25) 


where P is a vector of constant multipliers called Lagrange multipliers. 
If the Lagrange multipliers are given by the identity 


bJ 

p . = A 

3 bw. lj 

a ° 


or in matrix notation 


(A. 26) 


it = “ 


bj T r b'lt -~ 1 
bw 


r of -j 

L J 


(A.27) 


several observations can be made. Thus necessary conditions for 1 
to be a minimum (A. 13) become 


bJ 

bv 


= 0 


(A. 28) 
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The second order conditions (A. 24) may he written as 


+ 


C 

(' 


b 2 j 


2 

b *i\ . 


+ 

? 

6 *3 ^ 

Ov bv. 

^3 

bv bv. ) C k C n + 

Vbw. bv. 

bw . bv, / 

n k 

t 

n k 

x k 


x k 

2 

b J 

»3 

.2 

6 *3 \ . 

/ b 2 J 

+ ^ 

6 % 

bv bw, + 

bv bw./ in 

Vbh.bh. 

bw . bw . / 

n k 


n i 

3 i 


i 3 


c, h. 
k x 


h.h. 
i 3 


Define the vectors r and d as 


W 1 


h 

W 2 


h 2 

• 



• 


• 

• 


• 

w 

d = 

h 

P 


P 

V 1 


C 1 

V 2 


C 2 

• 



♦ 


• 

• 


• 

V 


c 

<1 


1 


(A.29) 


(A. 30) 


The second order condition (A. 29) may now he put in compact matrix 
notation hy using equation (A. 25) and definitions (A. 30): 



(A.31) 


To guarantee that _c and h in vector d satisfy equation (A.ll), elements 
of d may he expressed as functions of the independent constants _c only. 
In matrix notation this may he expressed as 


d = 



(A.32) 
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where 



(A. 33) 


and. I is a q by q indentity matrix. The second order condition (A. 32) 
may then be written as 



(A.34) 


The advantage of the Lagrange multiplier technique is that the 
first and second order conditions can be expressed in a compact 
matrix notation. 
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